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Abstract —The problem of finding network codes for gen¬ 
eral connections is inherently difficult in capacity constrained 
networks. Resource minimization for general connections with 
network coding is further complicated. Existing methods for 
identifying solutions mainly rely on highly restricted classes of 
network codes, and are almost all centralized. In this paper, we 
introduce linear network mixing coefficients for code constructions 
of general connections that generalize random linear network 
coding (RLNC) for multicast connections. For such code construc¬ 
tions, we pose the problem of cost minimization for the subgraph 
involved in the coding solution and relate this minimization to a 
path-based Constraint Satisfaction Problem (CSP) and an edge- 
based CSP. While CSPs are NP-complete in general, we present a 
path-based probabilistic distributed algorithm and an edge-based 
probabilistic distributed algorithm with almost sure convergence 
in finite time by applying Communication Free Fearning (CFF). 
Our approach allows fairly general coding across flows, guarantees 
no greater cost than routing, and shows a possible distributed 
implementation. Numerical results illustrate the performance 
Improvement of our approach over existing methods. 

Index Terms —network coding, network mixing, general con¬ 
nection, resource optimization, distributed algorithm. 

I. Introduction 

The problem of finding network codes in the case of general 
connections, where each destination can request information 
from any subset of sources, is intrinsically difficult and little 
is known about its complexity. In certain special cases, such 
as multicast connections (where destinations share all of their 
demands), it suffices to satisfy a Ford-Fulkerson type of min- 
cut max-flow constraint between all sources to every destination 
individually. For multicast connections, linear codes suffice [1], 

[2] , and lend themselves to a distributed random construction 

[3] . While linear codes have been the most widely considered 
in the literature, linear codes over finite fields may in general 
not be sufficient for general connections, as shown by [4] using 
an example from matroid theory. 

A matroidal structure for the network coding problem with 
general connections was conjectured by the late Ralf Kotter 
(private communication) but, while different aspects of this con¬ 
nection have been investigated in the literature [5]-[l 1], a proof 
remains elusive, except in special cases. Recently, the problem 
of scalar-linear coding has been shown to have a matroidal 


structure [7], [8], [12]. There exists a correspondence between 
scalar-linearly solvable networks and representable matroids 
over finite fields, which can be used to obtain some bounds on 
scalar linear network capacity [13] or the capacity regions of 
certain classes of networks [14]. More generally, the problem of 
finding the linear network coding capacity region is equivalent 
to the characterization of all linear polymatroids [9], whose 
structure was investigated in [10]. Reference [11] generalized 
the results of [15], which investigated the connection among 
index coding, network coding and matroid theory. In [16], 
polymatroids were used to produce linear code constructions. 

Progress in understanding the matroidal structure of the 
general connection problem has, however, not yet provided 
simple and useful approaches to generating explicit linear 
codes. There has been considerable investigation of restricted 
cases, such as a network with only two sources and two 
destinations, generally referred to as the two-unicast network 
[17]-[21], but thus far such investigation has yielded only 
bounds or linear solutions for restricted cases of the two-unicast 
network. It has been shown in [20] that the two-unicast problem 
is as hard as the most general network coding problem. Since 
the difficulty of coding in the case of general connections is 
in effect an interference cancellation one, approaches relying 
on interference alignment have naturally been explored [22]- 
[24]. Reference [25] investigated the enumeration, rate region 
computation and hierarchy of general multi-source multi-sink 
hyper-edge networks under network coding. 

Even when we consider simple scalar network codes, which 
have scalar coding coefficients, the problem of code con¬ 
struction for general connections remains vexing. The main 
difficulty lies in cancelling the effect of flows that are coded 
together even though they are not destined for a common 
destination. The problem of code construction is further compli¬ 
cated when we seek, for common reasons of network resource 
management, to limit fully or partially the use of links in the 
network. For convex cost functions of flows over edges in 
the graph corresponding to the network, finding a minimum- 
cost solution is known to be a convex optimization problem 
in the case of multicast connections (for continuous flows) 
[26]. However, in the case of general connections, network 
resource minimization, even when allowing only restricted code 
constructions, appears difficult. 

Among coding approaches for optimizing network use for 
general connections, we distinguish two types. The first, which 
we adopt in this paper, is that of mixing, by which we mean 
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coding together flows using random linear network coding 
(RLNC) [3], originally proposed for multicast connections. The 
principle is to code together flows as though they were part of a 
common multicast connection. In this case, no explicit coding 
coefficients are provided, and decidability is ensured with high 
probability by RLNC. For example, the mixing approaches 
in [27] and [28] are both based on mixing variables, each 
corresponding to a set of flows that can be mixed over an edge. 
Specifically, in [27], a two-step mixing approach is proposed for 
network resource minimization of general connections, where 
flow partition (mixing) and flow rate optimization are consid¬ 
ered separately. This separation imposes stronger restrictions 
on the mixing design in the first step and leads to a limitation 
on the feasibility region. Reference [28] studies the feasibility 
of more general mixing designs based on mixing variables of 
size 0{2^), where P is the number of flows. Reference [28] 
does not, however, provide an approach for obtaining a specific 
mixing design. The second type of coding approach is an 
explicit linear code construction, by which we mean providing 
specific linear coefficients over some finite field, to be applied 
to coding flows at different nodes. Often these constructions 
are simplified by restricting them to be pairwise. For example, 
in [29] and [30], simple codes over pairs of flows are proposed 
for network resource minimization of general connections. 

Some explicit linear network code construction approaches 
[29], [30] are distributed, but they allow only pairwise coding. 
The algorithms of [31] using evolutionary techniques, which 
are also explicit code constructions, are partially distributed, 
since the chromosomes can be decomposed into their local 
contributions, but require information to be fed back from 
the receivers to all the nodes in the network. In addition, the 
convergence results for evolutionary techniques are generally 
scant and do not yield prescriptive constructions. While RLNC 
for multicast connections is a distributed algorithm, most of 
the mixing approaches [27], [28] based on it have remained 
centralized. In [32], we propose new methods for constructing 
linear network codes for general connections of continuous 
flows based on mixing to minimize the total network cost. 
Flow splitting and coding over time are required to achieve the 
desired performance. The focus in [32] is to apply continuous 
optimization techniques to obtain continuous flow rates. In 
[33], we consider linear network code construction for general 
connections of integer flows based on mixing, and propose an 
edge-based probabilistic distributed algorithm to minimize the 
total network cost. This paper extends the results in [33]. 

Our contribution in this paper is to present new methods for 
constructing linear network codes in a distributed manner for 
general connections of integer flows based on mixing. 

• We introduce linear network mixing coefficients. The num¬ 
ber of mixing coefficients grows polynomially with the number 
of flows. We formally establish the relationship between linear 
network coding and mixing. 

• We formulate the minimization of the cost of the subgraph 
involved in the code construction for general connections of 
integer flows in terms of the mixing coefficients. 

• We relate our problem to a path-based Constraint Satisfac¬ 
tion Problem (CSP) and an edge-based CSP While CSPs are 


NP-complete in general, we present a path-based probabilis¬ 
tic distributed algorithm and an edge-based probabilistic dis¬ 
tributed algorithm with almost sure convergence in finite time 
by applying Communication Free Learning (CFL), a recent 
probabilistic distributed solution for CSPs [34]. The path-based 
distributed algorithm requires more local information than the 
edge-based distributed algorithm, but converges faster. 

• We show that our approach guarantees no greater cost 
than routing or the simplified mixing design in [27]. Numerical 
results also illustrate the performance improvement of our 
approach over existing methods. 

While our approach, like all other general connection code 
constructions, is generally suboptimal, it allows more flows to 
be mixed than is possible with pairwise mixing [29], [30] and 
with the separate mixing design in [27]. Moreover, in contrast 
to [27], [28], [32], our approach does not require non-scalar 
coding over time. 

II. Problem Setup and Definitions 
A. Network Model 

We consider a directed acyclic network with general con¬ 
nections.^ Let Q = {V,S) denote the directed acyclic graph, 
where V denotes the set of L = |V| nodes and E denotes the 
set of E = \E\ edges. To simplify notation, we assume there 
is only one edge from node i S V to node j S V, denoted 
as edge (i,j) S E? For each node i G V, define the set of 
incoming neighbors to be = {j : {j,i) & E} and the set of 
outgoing neighbors to be Oi = {j : {i,j) G f}. Let = |Ij| 
and Oi = \Oi\ denote the in-degree and out-degree of node 
2 G V, respectively. Assume h < D and Oi < D for all i G V, 
where 79 is a constant. 

Consider a finite field P with size F = |J^|. Let V = 
{I,-- - ,P} denote the set of P = \P\ flows of symbols in 
finite field P to be carried by the network. For each flow 
p €V, let Sp G V be its source. We consider integer flows. To 
simplify notation, we assume unit source rate (i.e., one finite 
field symbol per second).^ Let S = {si, • • • , sp} denote the set 
of P = |<S| sources. We assume different flows do not share a 
common source node and no source node has any incoming 
edges. Let T = {L,-- - jtp} denote the set of P = |T| 
terminals. Each terminal f G T demands a subset of Pt = \Vt\ 
flows Pt L P. Assume Uter'Pt = P- Let V = {Pt)teT denote 
the demands of all the terminals. We assume no terminal has 
any outgoing edges. 

As we consider integer flows, we assume unit edge capacity 
(i.e., one finite field symbol per second).'^ Let Zij G {0,1} 
denote whether edge {i,j) G ^ is in the subgraph involved in 

*The network model we considered in this paper is similar to that in [32] 
for continuous flows, but here we consider integer flows and edge capacities, 
and do not allow flow splitting and coding over time. 

^Multiple edges from node i to node j can be modeled by introducing 
multiple extra nodes, one on each edge, to transform a multigraph intro a 
graph. 

source with a positive integer source rate greater than one can be modeled 
by multiple sources, each with unit source rate. 

■^An edge with a positive integer edge capacity greater than one can be 
equivalently converted to multiple edges, each with unit edge capacity. 
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the code construction in a sense we shall make precise later.^ 
We assume a cost is incurred on an edge when information 
is transmitted through the edge and let Uij{zij) denote the 
cost function for edge We assume Uij{zij) is non¬ 

decreasing in Zij. We are interested in the problem of hnding 
linear network coding designs and minimizing the network cost 
j)£S fo'" general connections under those designs. 

B. Scalar Time-Invariant Linear Network Coding 

In linear network coding, a linear combination over T of 
the symbols in {aki G IF : k G li} from the incoming 
edges {{k,i) : k G li} can be transmitted through the 
shared edge (i,j) G £. The coefficients used to form this 
linear combination are referred to as local coding coefficients. 
Specihcally, let akij G T denote the local coding coefficient 
corresponding to edge (fc,i) G £ and edge {i,j) G £. Denote 
OL = io'kij)(k,i),{i,j)e£- Then, for linear network coding, using 
local coding coefficients, the symbol through edge {i,j) G £ 
can be expressed as 

(yij = ^ oikijCfki, {hi) i^S. ( 1 ) 

k^Xi 

This is illustrated in Fig. 1. 

Starting from the sources, we transmit source symbols {up G 
T : p GV}, and then, at intermediate nodes, we perform only 
linear operations over T on the symbols from incoming edges. 
Thus, the symbol of each edge can be expressed as a linear 
combination over IF of the source symbols {ap G F : p G V}. 
Let Cij^p G F denote the coefficient of flow p G V 'm the 
linear combination for edge (i, j) G £. This is referred to as the 
global coding coefficient of flow p GV and edge (i, j) G £. Let 
Cy = • • • , Cy^p, • • • , Cij^p) G F^ denote P coefficients 

corresponding to this linear combination for edge {i,j) G £. 
This is referred to as the global coding vector of edge {i,j) G 
£. Here, F^ represents the set of global coding vectors, the 
cardinality of which is . Then, using global coding vectors, 
the symbol through edge (i,j) G £ can also be expressed as 

(Jij = ^ ^ Cij^pCTp^ {i^j^G£^ i ^ S. (2) 

p&V 

This is illustrated in Fig. 1. 



Fig. 1 : Illustration of local and global coding coefficients. P = { 1 , 2 }. 
Then, we have Uki = Cki,i<J\ -f Cki,20-2, Ohi — Chi,iui -I- Chi,2<Z2, 
— OikijOki F OlkijeThi — -j- Cij,2rr2, — C^kijCki^l “f 

e^hijChi,! and Cij^2 ~ OLkijCki^2 “i” OLhijChi,2‘ 

^There is either no flow or a unit rate of (coded) flow through each edge. 
Under the unit source rate and edge capacity assumptions, we shall see that 
there is one global coding (mixing) vector for each edge. 


In this paper, we consider scalar time-invariant linear net¬ 
work coding. In other words, akij G F and Cij^p S -F are 
both scalars, and do not change over time. Let denote the 
vector with the p-th element being I and all the other elements 
being 0. For decodability to hold at all the terminals, the global 
coding vectors at all edges must satisfy the following feasibility 
condition for scalar linear network coding. 

Definition 1 (Feasibility of Scalar Linear Network Coding): 
For a network Q = (Vjf) and a set of flows V with sources 
S and terminals T, a linear network code c = (cij)(ij)^s is 
called feasible if the following three conditions are satisfied: 
1) Cjjpj = Bp for source edge {sp,j) G £, where Sp G S 
and p G V-, 2) = Y^keii'^kijCkz for edge {i,j) G £ 

not outgoing from a source, where i ^ S and akij G F', 3) 
Bp G span{ci( : i Git}, where p G Vt and t G T- 

Note that when using scalar linear network coding, for each 
terminal, extraneous flows are allowed to be mixed with the 
desired flows on the paths to the terminal, as the extraneous 
flows can be cancelled at intermediate nodes or at the terminal. 

C. Scalar Time-Invariant Linear Network Mixing 

As mentioned in Section I, to facilitate distributed linear 
network code designs for general connections using the mix¬ 
ing concept (without requiring the specific values of local 
or global coding coefficients in the designs), we introduce 
local and global mixing variables. Later, we shall see that 
distributed linear network mixing designs in terms of these 
mixing coefficients are much easier. Specifically, we introduce 
the local mixing coefficient (Skij G {0,1} corresponding to edge 
{k,i) G £ and edge {i,j) G £, which relates to the local coding 
coefficient Uk^j G F. Denote (3 = {l3kij){k,i),{i,j)e£- Pkzj = 1 
indicates that symbol aki of edge (fc, i) G £ is allowed (under 
our construction) to contribute to the linear combination over 
F forming symbol Oij in (1) and (3kij = 0 otherwise. Thus, 
if Pkij = 0, we have akij = 0; if Pkij = 1, we can further 
determine how symbol aki contributes to the linear combination 
forming symbol atj by choosing akij G F (note that akij can 
be zero when fdkij = !)■ 

Similarly, we introduce the global mixing coefficient Xij^p G 
{0,1} of flow p gV and edge (i, j) G £, which relates to the 
global coding coefficient Cij^p G F. Xij^p = 1 indicates that 
flow p is allowed (under our construction) to be mixed (coded) 
with other flows, i.e., symbol ap is allowed to contribute to 
the linear combination over F forming symbol aij in (2), and 
Xij^p = 0 otherwise. Thus, if Xij^p = 0, we have = 0; if 
Xij^p = 1, we can further determine how symbol ap contributes 
to the linear combination forming symbol atj (note that 
can be zero when Xij^p = 1). Then, we introduce the global 
mixing vector x^- = (xyp, • • • , x^-^p, • • • , Xij^p) G {0,1}^ for 
edge (i, j) G £, which relates to the global coding vector = 
(cijp, • • • , Cij^p, • • • , Cij^p) G F^. Here, {0,1}-^ represents the 
set of global mixing vectors, the cardinality of which is 2^. 

We consider scalar time-invariant linear network mixing. In 
other words, (3kij G {0,1} and G {0,1} are both scalars, 
and /3kij and Xij^p do not change over time. 

Global mixing vectors provide a natural way of speaking 
of flows as possibly coded or not without knowledge of the 
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specific values of global coding vectors. Intuitively, global 
mixing vectors can be regarded as a limited representation of 
global coding vectors. Given network mixing vectors, it may 
not be sufficient to tell whether a certain symbol can be decoded 
or not. Thus, using the network mixing representation, the 
extraneous flows, when mixed with the desired flows on the 
paths to each terminal, are not guaranteed to be cancelled at 
the terminal. For decodability to hold at all the terminals, the 
global mixing vectors at all edges must satisfy the following 
feasibility condition for scalar linear network mixing. 

Definition 2 (Feasibility of Scalar Linear Network Mixing): 
For a network Q = (y,£) and a set of flows V with 
sources S and terminals T, a linear network mixing design 
X = is called feasible if the following three 

conditions are satisfied: 1) = Op for source edge 

(spj) e £, where Sp G 5 and p e V; 2) Xij = WkGZiPkij^ki 
for edge {i,j) G £ not outgoing from a source, where i ^ S 
and Pkij € {0,1};'’ 3) Xit,p = 0, where i G It, p ^ Vt, t & T. 

Note that Condition 3) in Definition 2 ensures that for each 
terminal, the extraneous flows are not mixed with the desired 
flows on the paths to the terminal. In other words, linear 
mixing allows only mixing at intermediate nodes. This is not 
as general as using linear network coding, which allows mixing 
and canceling (i.e., removing one or multiple flows from a 
mixing of flows) at intermediate nodes. 

Given a feasible linear network mixing design, one of the 
ways to implement mixing when (F is large is to use random 
linear network coding (RLNC) [3], [27], as discussed in the 
introduction. In particular, when fikij = 1, ockij can be 
randomly, uniformly, and independently chosen in F using 
RLNC; when jSkij = 0, akij has to be chosen to be 0. 


III. Mixing Problem Formulation 


In this section, we formulate the problem of selecting mixing 
coefficients (3 and x to minimize the cost of the subgraph 
involved in the coding solution, i.e., the set of edges used in 
delivering the flows. 

In the following formulation, Zij G {0,1} indicates whether 
edge (f,j) G £ is involved in delivering flows, and /Lp G 
(0,1} indicates whether edge (*, j) G f is involved in deliver¬ 
ing flow p G Vt to terminal t G T- 
Problem 1 (Mixing): 


U*{T) = min 

z.f,x,/3 


t/y- {Zij ) 


s.t. Zij G {0,1}, (f,j) G £ 

Xij,p e {0,1}, {i,j) g£, pGV 
Pkij G {0,1}, {k,i),{i,j) G £ 
fij,p ^ 'L®’ C P G 'Pt, t gT 

'y fij,p — (F j) G f, t GT 

pSPt 


(3) 

(4) 

(5) 

(6) 
(7) 


fik,p ~ fki,p — ^i,p^ i G V, P G Vt, t GT 

keOi feeii 

( 8 ) 


^Note that V denotes the “or” operator (logical disjunction). 


f!j,p < Xij^p, (f, j) g£, pGVt, IgT (9) 

~ i^p^j') G p G D (10) 

Xjj — ^kGZi/3kij^ki, (^;j) G £, i ^ S (11) 

Xit,p = 0, i G It, ptj^Vt, t gT (12) 

where 

{ 1 , % — Sp 

— 1, i = t iGV, pGVt, tG'T- (13) 
0, otherwise 

Here, z i^ij) ^tid f (^fij^)p^'p^ taT- 

Consider a feasible solution z, f, x and /3 to Problem 1. By 
(6) and (8), we know that for all p G Vt and t G T, all the 
edges in {(f,j) G £ : fljp = 1} form one flow path (i.e., a set 
of ordered edges (f, j) G £ such that /L p = 1) from source Sp 
to terminal t. In addition, combining (3) and (7), we have an 
equivalent constraint purely in terms of f, i.e., 

f!,,pG{0,l}, {z,j)G£,tGT. (14) 

p€Pt 

From this, we know that for all p,p' G Vt, P p' and 
t G T, the two flow paths from sources Sp and Sp/ to terminal 
t are edge-disjoint. Finally, the feasibility constraints in (10), 
(11) and (12) together with (4) and (5) set other requirements 
on flow paths (i.e., f) via the constraint in (9). Therefore, a 
feasible solution to Problem 1 corresponds to a set of flow paths 
satisfying certain requirements, as illustrated above. These 
interpretations can be understood from the following example. 

Example 1 (Illustration of Problem 1): As illustrated in 
Fig. 2, we consider a network with V = {1,2}, S — {1,2}, 
r = {8,7,10}, Vi = {1}, V 2 = V 3 = {1,2}, and 
Uij{zij) = Zij G {0,1} for all (i,j) G £. Problem 1 for 
the network in Fig. 2 has two feasible solutions of network 
costs 11 and 12, as illustrated in Fig. 2 (a) and Fig. 2 (b), 
respectively. Specifically, the two feasible solutions share the 
same flow paths from sources si and S 2 to terminals t 2 and 
fs, i.e., flow paths 1—3 — 4 — 6 — 7, 2 — 5 — 7, 1 — 3 — 9 — 10, 
2 — 5 — 4 — 6 — 10. Notice that the two feasible solutions have 
different flow paths from source si to terminal ti, i.e., flow 
path 1 — 3 — 8 for the feasible solution in Fig. 2 (a) and flow 
path 1 — 3 — 9 — 11 — 8 for the feasible solution in Fig. 2 (b). 
The optimal solution is the one illustrated in Fig. 2 (a) and 
the optimal network cost is 11. 

We now illustrate the complexity of Problem 1. The num¬ 
ber of variables in (3 is = Y^jav^jOj < 

D number of variables in f is smaller 

than or equal to PTE. The numbers of variables in z and x 
are E and PE, respectively. Therefore, the total number of 
variables in Problem 1 is smaller than or equal to {D + 1)E + 
(T -I- 1)PE, i.e., polynomial in E, T and P. Problem 1 is a 
binary optimization problem, and does not appear to have a 
ready solution. 

^Note that the optimal value in Problem 1 is a function of Q, S, T and P, 
which are assumed to be fixed. Here, we write U* (P) as a function of P only 
to emphasize the impact of P on U*(P), which is helpful when considering 
demand set expansion in Problem 2. 
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(a) One feasible solution of network cost 11. 



(b) One feasible solution of network cost 12. 


Fig. 2: Illustration of feasible solutions to Problem 1. P = {1,2}, 
5 = {1,2}, r = {8,7,10}, Pi = {!}, P2 = Ps = {1,2}, 
Uij{zij) = Zij € {0,1} for all {i,j) € S. The flow paths from the 
two sources are illustrated using green and blue curves, respectively. 
Since P 2 = Ps = {1, 2}, the flows from si to t 2 and S 2 to fs are 
allowed to be mixed at edge (4,6). The red edges carry network- 
coded information. In this topology. Problem 1 has two feasible 
solutions. However, neither the two-step mixing approach for general 
connections in [27] nor routing provides a feasible solution. 


Remark 1 (Problem 1 for Multicast): When P* = P for 
all f G T (i.e., multicast), the constraint in (12) does not 
exist, and the constraint in (9) is always satisfied by choosing 
Pkij = 1 for all (fc,i), (*, j) G £ and choosing x accordingly 
by (10) and (11). Therefore, Problem 1 for general connections 
reduces to the conventional minimum-cost scalar time-invariant 
linear network code design problem for the multicast case. The 
complexity of the optimization for the multicast case is much 
lower than that for the general case. This is because in the 
optimization for the multicast case, variables x and /3 do not 
appear, and the constraints in (4), (5), (9), (10), (11) and (12) 
can be removed. 

In the following, we show that a feasible linear network code 
can be obtained using a feasible solution to Problem 1 (e.g., 
using RLNC [3]), as illustrated in Section II-C. 

Theorem 1: Suppose Problem 1 is feasible. Then, for each 
feasible x and /3, there exists a feasible linear network code 
design a and c with a field size F > T to deliver the desired 
flows to each terminal. 

Proof: Please refer to Appendix A. ■ 

Next, the minimum network cost of Problem 1 is no greater 



Fig. 3: Illustration of the network coding gain improvement of 
Problem 2 over Problem 1 for two unicasts over the butterfly network. 
Problem 1 (left) is not feasible, as the flows from si to t 2 and S 2 to 
ti are not allowed to be mixed over edge (3,4) when Pi n P 2 = 0. 
However, Problem 2 (right) is feasible, as the flows are allowed to be 
mixed over edge (3,4) after the demand set expansion to P. 


than the minimum costs of the two-step mixing approach for 
general connections in [27] and routing for integer flows, 
owing to the following reasons. Problem 1 with j3kij = 1 
for all G £, instead of (5), is equivalent to the 

minimum-cost flow rate control problem in the second step of 
the two-step mixing approach for general connections in [27]. 
Problem 1 with an extra constraint ^ 1} 

(i, j) G £ is equivalent to the minimum-cost routing problem. 
Fig. 2 illustrates a feasible solution to Problem 1 that cannot 
be obtained by the two-step mixing approach [27] or routing. 
In this example, the minimum network cost of Problem 1 is 
smaller than those of the two-step mixing approach [27] and 
routing (which can be treated as infinity). 

When P( nPt/ = 0 for all t and t,t' gT (e.g., multiple 
unicasts). Problem 1 for general connections reduces to the 
minimum-cost routing problem and cannot take advantage of 
the network coding gain. This is because using the network 
mixing representation, for decodability to hold, the extraneous 
flows of each terminal are not allowed to be mixed with the 
terminal’s desired flows on the path to this terminal, thus 
limiting the network coding gain. To address this limitation, 
we now formulate Problem 2, which allows the expansion of 
the demand sets and the optimization over the expansions to 
increase the opportunity for mixing flows to different terminals. 
Let Vt denote the expanded demand set, which satisfies Vt L 
Vt L V. Let P = {Vt)t^T denote the expanded demand sets 
of all the terminals. 

Problem 2 (Mixing with Demand Set Expansion): 


U* = min 

U*{V) 

(15) 

{■Pt} 


s.t. 

VtcVtFV, ter 

(16) 


where U*{T’) is the optimal value to Problem 1 for V. 

The network coding gain improvement of Problem 2 can 
be easily understood from the case of two unicasts over the 
butterfly network, as illustrated in Fig. 3. By Theorem 1, we 
can easily show the following result. 

Corollary 1: Suppose Problem 2 is feasible. Then, for each 
feasible solution, there exists a feasible linear network code 
design with a held size F > T to deliver the desired flows to 
each terminal. 
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By comparing Problem 1 and Problem 2, we can obtain the 
following lemma. 

Lemma 1: Suppose Problem 1 is feasible. Then, Problem 2 
is feasible and U* < U*{V). 

In the following sections, we focus on solving Problem 1 
for given T’. However, the obtained centralized and distributed 
algorithm can be easily extended to solve Problem 2 by further 
optimizing over V. Later, in Section VI, we shall illustrate the 
results for Problem 1 and Problem 2 numerically. 


IV. Centralized Algorithm 


In this section, we develop a centralized algorithm to solve 
Problem 1, based on the concept of edge-disjoint flow paths 
discussed before. The centralized algorithm is conducted at a 
central node which is aware of global network information. 

For all p G Vt and t G T, let N* denote the number of flow 
paths from source Sp to terminal t. For Problem 1 to be feasible, 
assume N* > 0. As illustrated in Section III, obtaining a 
feasible solution to Problem 1 is equivalent to selecting a set of 
flow paths satisfying certain requirements. Thus, we introduce 
flow path selection variables to indicate the flow paths selected 
for information transmission. Let n* G {1, • • • , Ari} denote the 
flow path selection variable for source Sp and terminal t (i.e., 
the index of the selected flow path from source Sp to terminal t). 
Denote the flow path selection variables as n = {np)p^-p^^t^'j-. 
In the following, we express variables z, f, x and (3 in terms of 
n. To satisfy (6) and (8), we require n* to take only one value 
from {1, • • • ,Np}. Let Cp{np) denote the selected flow path 
(i.e., the set of edges on the selected flow path) from source 
Sp to terminal t. To satisfy (3) and (7) (or equivalently (14)), 
we require that the Pt flow paths from sources {sp : p G Pt} 
to terminal t are edge-disjoint, i.e.. 


n Cl,{nl,) = 0, p,p' G Vt, p ^ p', t G T. (17) 

Then, variables f(n) 4 {fljMp)) 
expressed in terms of variables n as follows: 


f* (n‘) 

J lj,p\ P J 



{i,j) G Clinl) 

otherwise 


(i,j) G£, pGVt, IgT. 


(18) 


By (7) and the monotonicity of variables z(n) = 

(%(n))(, . can be chosen based on f(n) and expressed 

implicitly in terms of variables n as follows: 

Zij{n) = ^ /y-p^), (ij) G £. (19) 

pe'Pt 


Algorithm 1 Centralized Algorithm for Problem 1 

1: For all p G Vt and t G T, obtain all the flow paths {Lpjnj,) : 
Up G {1, • • • , V*}} from source Sp to terminal t, using depth- 
first-search (DFS). 

2: For all t G T, obtain the set of Pt edge- 
disjoint flow paths C* — {{£p{np))p£Vt ’■ uj, G 

{1, • ■ • , Np} for all p GVt and (17) is satisfied} from sources 
{sp : p G Pt} to terminal t. 

3: Calculate the network costs of L = Iltgr combinations of 
Pt edge-disjoint flow paths for all terminal t gP, and sort the L 
combinations in the ascending order of their network costs, where 
L* = |£*| and the 1-th combination is of the 1-th smallest network 
cost Ui. 

4: initialize 1 = 1 and flag — 1. 

5: while flag = 1 do 

6: For all p GVt and t G T, let rip denote the index of the flow 

path from source Sp to terminal t in the 1-th combination. 

7: For all p G Vt, t G T and {i,j) G £, set flj p{np) according 

to (18). 

8: For all (i,j) G £, set Zij(n) according to (19). 

9: For all {i,j) G S, set /Skijin) according to (20). 

10: Based on {j3kij{ji)}, (10) and (11), determine {a;ij,p(n)} in 

the topological order. 

11: if (21) is satisfied then 

12: let UfifP) = Ui, z* = z(n), f* = f(n)}, x* = x(n)}, 

/3* = /9(n), and set flag = 0 

13: else 

14: set 1 = 1 -I- 1 

15: end if 

16: end while 


Then, based on (10) and (11), x(n) can be determined in 

topological order® and expressed implicitly in terms of variables 
n. Finally, to satisfy (12), we require 

a;*t.p(n) =0, i Git, P ^Pt, t G T. (21) 

Based on the above relationship between the flow path selection 
variables and the variables of Problem 1, we now describe the 
procedure of the centralized algorithm, i.e.. Algorithm 1, which 
obtains the feasible flow paths of the minimum network cost. 

Note that Constraints (6) and (8) are guaranteed in Step 1 
and Step 8; Constraints (3) and (7) are guaranteed in Step 2 
and Step 7; Constraints (4), (5), (9) are guaranteed in Step 9; 
Constraints (10) and (11) are guaranteed in Step 10; and 
Constraint (12) is considered in Steps 11-15. Therefore, we 
can see that the optimization to Problem 1 can be obtained by 
Algorithm 1. 

V. Probabilistic Distributed Algorithms 
In this section, using recent results for CSP [34], we develop 
two probabilistic distributed algorithms to solve Problem 1. 


In addition, to satisfy (4), (5), (9) and (11), ,9(n) = 

Pkij t.) (i tie chosen based on f (n) and expressed 

implicitly in terms of variables n as follows: 

1, maxter.pGT’t /L.pK)/y,p(^P = 1 

0, otherwise ’ 

{k,i),{i,j) G £. (20) 


A. Background on Decentralized CSP 

We review some existing results on CSP in [34]. 

Definition 3 (Constraint Satisfaction Problem): [34] A 

CSP consists of M variables {Ai,--- ,Am} and K clauses 

topological order of a directed graph Q = {V^S) is an ordering of its 
nodes such that for every directed edge {i^j) € S from node f G V to node 
J E V, f comes before j in the ordering. Such an order exists for the edges of 
any directed graph Q that is acyclic. 
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Each variable Am takes values in a finite 
set A, i.e., Am S A for all m S A4 = {I,-- - ,Af}. Let 
A 4 (Ai,... , Am) G A“. Each clause e /C = {1, • • • , AT} 
is a function : A^ {0,1}, where for an assignment of 
variables A € A^, (t>k{^) = 1 if clause m is satisfied and 
4>k{^) = 0 otherwise. An assignment A G A^ is a solution to 
the CSP if and only if all clauses are simultaneously satisfied 

min(/)fc(A) = 1. (22) 

kGfC 

To solve a CSP in a distributed way, clause 
participation is introduced in [34]. Let A_m = 
(Ai,---,Am-i,Am+i,--- ,Am) G A^“A Eor each 
variable Am, let /Cm denote the set of clause indices 
in which it participates, i.e., /Cm = UA^,„gAM-i{c : 
minA„eA ?f>fc(Am, A_m) = 0, maxA„eA (/'fc(Am, A_m) = 1}. 
Thus, we can rewrite the left hand side of (22) in a 
way that focuses on the satisfaction of each variable, i.e., 
rniumeAT (/'fc(A) = 1. This form enables us to solve 

CSPs in a distributed iterative way by locally evaluating the 
clauses in /Cm and then updating Am. 

CSPs are in general NP-complete and most effective CSP 
solvers are designed for centralized problems. The CEL al¬ 
gorithm [34, Algorithm 1], summarized in Algorithm 2, is 
a distributed iterative algorithm which can find a satisfying 
assignment to a CSP almost surely in finite time [34, Corollary 
2]. Note that Algorithm 2 keeps a probability distribution 
over all possible values of each variable. The value of each 
variable is selected from this distribution. Eor each variable, 
if all the clauses in which a variable participates are satisfied 
with its current value, the associated probability distribution is 
updated to ensure that the variable value remains unchanged; 
if at least one clause is unsatisfied, the probability distribution 
evolves by interpolating between it and a distribution that is 
uniform on all values except the one that is currently generating 
dissatisfaction. Therefore, if all variables are simultaneously 
satisfied in all clauses, the same assignment of values will be 
reselected indefinitely with probability 1. 


Algorithm 2 Communication-Free Learning [34] 


1: Initialize gm(A) = for all A G A, where |A| denotes the 
cardinality of A. 

2: loop 

3: Realize a random variable, selecting Am = A with probability 

Qm ( A). 

4: Evaluate min^gx;™ </’/c(A), returning satisfied if its value is 1 

and unsatisfied otherwise. 

5: if satisfied then 


6: set(jm(A) 


1, if A = Am 
0, otherwise 


7: else 


8: set qm(A) = ■ 

where a, fe G (0, 

9: end if 

10: end loop 


J (1 - &)gm(A) -I- 
[(1 -6)9m(A) -b + 
1] are design parameters. 


if A = Am 

otherwise 


B. Path-based Probabilistic Distributed Algorithm 


In this part, we develop a path-based probabilistic distributed 
algorithm to solve Problem 1, using recent results in [34]. 
This distributed algorithm is based on the concept of edge- 
disjoint flow paths discussed before. It can be viewed as a 
distributed version of the path-based centralized algorithm, i.e.. 
Algorithm 1. Eor all p G "P*, f G T and G {1, • • • ,/Vp}, 
this algorithm requires each node on the n^-th flow path from 
source Sp to terminal t to know its neighboring edge on this 
flow path. Note that it is not necessary for each node on the 
rip-th flow path to be aware of other edges on the n^-th flow 
path. 

Eirst, we construct a path-based CSP corresponding to the 
feasibility problem obtained from Problem 1. Treat n as the 
variables of the path-based CSP, where n* G {1, • • • ,AP} 
denotes the index of the selected flow path from source Sp 
to terminal t. As illustrated in Section IV, the constraints of 
f in (6) and (8) for Problem 1 can be taken into account by 
choosing G {1, • • • , N^}, for all p G Vt and t & T- The 
constraints in (3) and (7) (or equivalently (14)) can be replaced 
by the constraint of n in (17) for the path-based CSP. Variables 
{/lyfe(n)} and {xiy_p(n)} can be determined for given n via 
(18), (20), (10) and (11). Thus, the last constraint in (12) of 
Problem 1 can be replaced by the constraint of n for the path- 
based CSP in (21). Therefore, we can write the clause for 
as follows; 


(n) 



if (17) and (21) hold 
otherwise 


p G Pt, f G P. 

(23) 


We thus have the following proposition.® 

Proposition 1 (Path-based CSP): The path-based CSP with 
variables n (rip G {1, • • • , N^}) and clauses (23) has consid¬ 
ered all the constraints in Problem 1. 

Now, we present a path-based distributed probabilistic al¬ 
gorithm, i.e.. Algorithm 3, to obtain a feasible solution to 
the path-based CSP using CEL [34, Algorithm 1]. Based on 
the convergence result of CEL [34, Corollary 2], we know 
that Algorithm 3 can find a feasible solution to Problem 1 
in almost surely finite time. Pig. 4 illustrates the convergence 
of Algorithm 3. Prom Pig. 4, we can see that Algorithm 3 
converges to a feasible solution (i.e., the feasible solution 
illustrated in Pig. 2 (a)) to Problem 1 for the network in 
Pig. 2 quite quickly (within 35 iterations). This feasible solution 
corresponds to flow paths 1 — 3 — 8, 1 — 3 — 4 — 6 — 7, 2 — 5 — 7, 
1 — 3 — 9 — 10 and 2 — 5 — 4 — 6 — 10. The network cost of 
this feasible solution is 11. 

Relying on Algorithm 3, we present a path-based distributed 
probabilistic algorithm. Algorithm 4, to obtain the optimal solu¬ 
tion to Problem 1 among multiple feasible solutions obtained by 
Algorithm 3.'° Since Algorithm 3 can find any feasible solution 


®Note that the clauses of the path-based CSP cannot be further partitioned, 
as all the vaiiables n are coupled in general. 

*®In Step 3 of Algorithm 4, the path-based CFL is run for a sufficiently 
long time. Step 6 of Algorithm 4 can be implemented with a master node 
obtaining the network cost of the path-based CFL from all nodes or with all 
nodes computing the average network cost of the path-based CFL locally via 
a gossip algorithm. 







s 


Algorithm 3 Path-based CFL 


1: For all p £ Vt and t £ T, obtain all the flow paths from source 
Sp to terminal t, using DFS. 

2: For all p £ Vt and t £ T, source Sp initializes Pp (n) = for 

all n € {1, Np}. 

3: loop 

4: For all p £ Vt and t £ T, source Sp realizes a random 

variable, selecting rip = n with probability qp{n), where n £ 
Ap}, and sends signaling packet {p,t,np) over edge 
(•Spi j) £ to node j. Once node j receives signaling 

packet {p,t,np) over edge {i,j) € Cp{np) , it forwards this 
signaling packet to node k over edge {j, k) £ j[lp{np). 

5: For all {i,j) £ £, if edge {i,j) receives signaling packets 

{{p,t,np) : p £ Vt} to terminal t £ T from more than one 
source in {sp : p G Pt}, it sends NAK back to each of these 
sources along its selected path rip. 

6: For all p £ Vt, t £T and {i,j) £ £, set flj p{rip) according 

to (18). 


10 : 

11 : 

12 : 

13: 

14: 


For all {k,i), {i,j) £ S, set Pkij{n) according to (20). 

Based on /3(n), (10) and (11), determine x(n) in topological 
order. 

Every terminal t £ T checks (21). For all i £lt and p 0 Vt, 
if (21) is unsatisfied, terminal t sends NAK back to source Sp 
(along any path to source Sp), and sends NAK back to each of 
the sources in Vt (along its selected path), 
for p £Vt and t £T Ao 

if source Sp receives no NAKs then 


set ijp(n) 


else 


set 


qlij. 


1, if n = rip 
0, otherwise 

(1 - b)ql{n) + ,if n = n* 

(1 - h)q*p{n) + , otherwise 


. i+f’ 

where a,b £ (0,1] are design parameters. 

end if 
end for 
end loop 


Algorithm 4 Path-based Distributed Algorithm 

1: ( = 1 and Ui = -poo. 

2: loop 

3: Run the path-based CFL in Algorithm 3 to the path-based CSP 

corresponding to Problem 1. Let n; denote the feasible solution 
obtained by Algorithm 3 and let Ui denote the corresponding 
network cost. 

4: if Ui < Ui then 

5: set Ui+i = Ui, n* = n;, and 1 = 1 + 1. 

6: end if 

7: end loop 


to Problem 1 with positive probability, Ui U*{{Vt}) almost 
surely as / —p cx), where Ui denotes the minimum network 
cost obtained by the first I path-based CFLs. Fig. 5 illustrates 
the convergence of Algorithm 4. From Fig. 5, we can see that 
Algorithm 4 obtains the optimal network cost 11 to Problem 1 
for the network in Fig. 2 quite quickly (within 5 iterations). 


C. Edge-based Probabilistic Distributed Algorithm 

In this part, we develop an edge-based probabilistic dis¬ 
tributed algorithm to solve Problem 1, using recent results in 
[34]. Compared with the path-based distributed algorithm in 
Section V-C, this edge-based distributed algorithm does not 



5 10 15 20 25 30 35 40 45 50 

Iterations 

(b) Flow path from source si to terminal t 2 . 


5 10 15 20 25 30 35 40 45 50 

Iterations 

(c) Flow path from source S 2 to terminal t 2 . 


-H- 1-3-4-6-10 

-1-3-9-10 
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Iterations 

(d) Flow path from source si to terminal 


-H- 2-5-4-6-10 


5 10 15 20 25 30 35 40 45 50 

Iterations 

(e) Flow path from source S 2 to terminal 


Fig. 4: Convergence of the path-based CFL in Algorithm 3 for 
Problem 1 of the network in Fig. 2. a = 1 and b = 0.01. These 
convergence curves are for one realization of the random Algorithm 3. 
Note that all the flow paths are shown in the figure. 


o 

o 



Iterations 


Fig. 5: Network costs of the path-based CFLs in Algorithm 4 for 
Problem 1 of the network in Fig. 2. Each blue dot represents the 
network cost of a feasible solution obtained by the path-based CEL 
in each iteration of Algorithm 4. While the red curve represents the 
minimum network cost obtained by Algorithm 4 within a certain 
number of iterations. The blue dots and red curve are for one 
realization of the random Algorithm 4. 
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require any path information. 

Obtaining a feasible solution to Problem 1 can be directly 
treated as a CSP [34]. Specifically, z, f, x, (3 and {0,1} can be 
treated as the variables and the finite set of the CSP. Constraints 
(7)-(12) can be treated as the clauses of the CSP. While CSPs 
are in general NP-complete, several centralized CSP solvers 
(see references in [34]) and the distributed CSP solver proposed 
in [34] can be applied to solve this (naive) CSP. However, the 
direct application of the distributed CSP solver in [34] leads to 
high complexity owing to the large constraint set. In this part, 
by exploring the features of the constraints in Problem 1, we 
obtain a different CSP and present a probabilistic distributed 
solution with a significantly reduced number of clauses. 

First, we construct a new problem, which we show to be 
a CSP. This new problem is better suited than the original 
problem to being treated using a probabilistic distributed al¬ 
gorithm based on the distributed CSP solver presented in [34]. 
Combining (3) and (7), we have an equivalent constraint purely 
in terms of f, i.e., (14). In addition, from (11), we have an 
equivalent constraint purely in terms of x, i.e., 

3 [3kij C {0, 1} VA; ^ S.t. 'X.ij = ^ Pkij^ki } 

{i,j) e£, i^S. (24) 


Therefore, we can solve only for variables f and x in a 
distributed way, as z can be obtained directly from feasible 
f by choosing = maxtgr Epeftaccording to (3) 
and (7), and /3 can be obtained from feasible x by (10) 
and (11). We group all the local variables for each edge 
{i,j) G £ and introduce the vector variable (fij,Xy) G 
y^j, where % ^ = {fh.p)pev, " 

{(%,x,j) : (4), (6), (9), (10), (12), (14)}. We also write y,, = 
{Vij,!,--- ,yij,Yij}, where Fy = \yij\. We now consider a 
new CSP, different from the naive one that would be directly 
obtained from Problem 1. We treat (f^, x^j) and as the 
variable and the finite set for edge (*,j) of the CSP. We write 
the clauses for {(fy,Xy)} as follows; 


4 (f*) = 


1, if(8)holdsVpGT’t, feT 
0, otherwise 


, i G V (25) 


4^ij • A; G T^}) — 


1, if (24) holds 
0, otherwise 

{i,j)G£,i^S (26) 


where = {iik)k^o k^i ■ Note that the local constraints in (4), 
(6), (9), (10), (12) and (14) (i.e., (3) and (7)) are considered 
in the finite set yij of the CSP with respect to each edge 
(i,j) G £■ On the other hand, the non-local constraints in (8) 
and (24) are considered in clauses cj){ in (25) and (pfj in (26), 
respectively. We thus have the following proposition. 

Proposition 2 (Edge-based CSP): The edge-based CSP with 
variables (fy , x^) G yij, {i,j) G £ and clauses (25) and (26) 
has considered all the constraints in Problem 1. 

Note that the number of variables (E) and the number of 
clauses (< V E — P) of the new CSP are much smaller 
than the number of variables (< (1DP -f TP)E) and the 
number of clauses (< (1 -f T -f TP)E -f TPV -f TPD) of the 


Algorithm 5 Edge-based CEL 


1: For all {i,j) G £, edge (i,j) initializes qij{y) = ^ for all 

yeTij- 

2: loop 

3: For all {i,j) G £, edge {i,j) realizes a random variable, 

selecting (fij,Xij) = y with probability qij(y), where y G 

yij- 

4: for (i,j) G f do 

5: Each edge (i,j) evaluates all the clauses in 4>ij. 

6: if all clauses in are satisfied then 


7: 


set qij (y) = 


1 , 

0, 


if y = 

otherwise 


else 

set qij (y) = 


(1 - b)qij(y) -b ■ 


Yij-l+a/6’ 

{l-h)qij{y) + ^-^j^, 
where a,b £ (6,1] are design parameters. 


end if 
end for 
end loop 


y = (fij,Xij) 
otherwise 


naive CSP directly obtained from Problem 1. This feature will 
favor the complexity reduction of a distributed solution based 
on the distributed CSP solver in [34]. 

Next, we construct the clause partition. The set of clauses in 
which variable (fy, x^) participates is 

= {</>{> '/'J} U ^ <S, A G O,} , (z, j) G f. 

(27) 

Then, the focus can be on the satisfaction of each variable 
(fy , Xy), i.e., the satisfaction of each set of clauses 4>y. Now, 
the new CSP can be solved using the distributed iterative CFL 
algorithm [34, Algorithm 1]. Specifically, each edge {i,j) G £ 
realizes a random variable selecting (fij,Xy). Allow message 
passing on (fij,Xij) between adjacent nodes to evaluate the 
related clauses. Based on whether the clauses in (27) are 
satisfied or not, the distribution of the random variable of each 
edge (i,j) G £ is updated. The details are summarized in Al¬ 
gorithm 5, which obtains a feasible solution to the edge-based 
CSP using CFL [34, Algorithm 1]. Based on the convergence 
result of CFL [34, Corollary 2], we know that Algorithm 5 can 
find a feasible solution to Problem 1 in almost surely finite 
time. Fig. 6 illustrates the convergence of Algorithm 5. From 
Fig. 6, we can see that Algorithm 5 converges to a feasible 
solution to Problem 1 for the network in Fig. 2 within 5000 
iterations. This feasible solution is the same as the one shown 
in Fig. 4, with network cost 11. 

Relying on Algorithm 5, we present an edge-based dis¬ 
tributed probabilistic algorithm. Algorithm 6, to solve Prob¬ 
lem 1." Since Algorithm 5 can find any feasible solution to 
Problem 1 with positive probability, Ui —> U*{{Vt}) almost 
surely as ( —oo, where Ui denotes the smallest network cost 
obtained by the first I edge-based CFLs. Fig. 7 illustrates the 
convergence of Algorithm 6. From Fig. 7, we can see that 
Algorithm 6 obtains the optimal network cost 11 to Problem 1 
for the network in Fig. 2 quite quickly (within 5 iterations). 

' * Note that Step 3 and Step 6 of Algorithm 6 can be implemented in similar 
ways to those in Algorithm 4. 
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Algorithm 6 Edge-based Distributed Algorithm 

1: / = 1 and Ui = +oo. 

2 : loop 

3: Run the edge-based CEL in Algorithm 5 to the edge-based CSP 

corresponding to Problem 1. Let : (i,j) € S} 

denote the feasible solution obtained by Algorithm 5 and let 
Ui denote the corresponding network cost. 

4: if IJi < Ui then 

5: set Ui+i = Ui, for all (i,j) € £, 

and 1 = 1 + 1. 

6: end if 

7: end loop 
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(a) Variable (f 25 ,X 25 ) for edge (2,5). 
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(b) Variable (f 57 ,X 57 ) for edge (5,7). 
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Fig. 6: Convergence of the edge-based CEL in Algorithm 5 for 
Problem 1 of the network in Pig. 2. a = 1 and b = 0.01. Note 
that for each edge, the “Correct Event” indicates the variable taking 
the value which corresponds to the feasible solution obtained by the 
edge-based CEL. These convergence curves are for one realization of 
the random Algorithm 5. 
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Fig. 7: Network costs of the edge-based CFLs in Algorithm 4 for 
Problem 1 of the network in Fig. 8. Each blue dot represents the 
network cost of a feasible solution obtained by the edge-based CEL 
in each iteration of Algorithm 4. While the red curve represents the 
minimum network cost obtained by Algorithm 4 within a certain 
number of iterations. The blue dots and red curve are for one 
realization of the random Algorithm 4. 


D. Comparison 

In this part, we compare the path-based and edge-based 
distributed algorithms. In obtaining a feasible solution to Prob¬ 


lem 1, the path-based CFL, i.e., Algorithm 3 and the edge- 
based CFL, i.e., Algorithm 5 both base on CFL [34, Algorithm 
1]. The convergence result of CFL [34, Corollary 2] guarantee 
that Algorithm 3 and Algorithm 5 both converge to feasible 
solutions to Problem 1 almost surely in finite time. However, 
Algorithm 3 converges much faster than Algorithm 5 in our 
simulations. This is expected, as Algorithm 3 solves a path- 
based CSP, while Algorithm 5 solves an edge-based CSP. The 
number of variables and the number of possible values for each 
variable for the path-based CSP are much smaller than those 
for the edge-based CSP. The difference in the convergence rates 
of Algorithm 3 and Algorithm 5 can be seen by comparing 
Fig. 4 and Fig. 6. On the other hand. Algorithm 3 requires more 
local information than Algorithm 5. In particular. Algorithm 3 
requires all the nodes on one path from a source node to a 
terminal node to be aware of their neighboring nodes on the 
path (not all the nodes on the path). Algorithm 5 instead only 
requires each node to be aware of its neighboring nodes. 

In obtaining an optimal solution to Problem 1 among mul¬ 
tiple feasible solutions, the path-based distributed algorithm, 
i.e., Algorithm 4 and the edge-based distributed algorithm, i.e., 
Algorithm 6 base on the path-based CFL, i.e., Algorithm 3 and 
the edge-based CFL, i.e., Algorithm 5, respectively, in the same 
way. Therefore, Algorithm 4 and Algorithm 6 share similar 
convergence properties. This can be illustrated in Fig. 5 and 
Fig. 7. 


VI. Numerical Illustration 

In this section, we numerically illustrate the performance 
of the proposed optimal solutions to Problems 1 and 2 using 
mixing only with the two-step mixing approach in [27] and 
optimal routing for general connections of integer flows. 

In the simulation, we consider the Sprint backbone network 
[35] as illustrated in Fig. 8. We choose sources S = {8,11} and 
terminals T C {2, 3,4, 6,9}. The edge directions are chosen 
to permit connections and help illustrate network coding gain. 
The green edges have edge cost 1, while the blue edges have 
edge cost 10 or 20. The edge costs are chosen to make the 
network coding advantage exist at least for some connection 
requests [31]. Note that network coding gain takes effect only 
if transmitting coded information requires a lower network cost 
than routing. We consider 1000 random realizations of demand 
sets. For each realization, a pair or triplet of terminals (i.e., 
T = 2, 3) are selected from {2, 3,4, 6,9} uniformly at random, 
and each selected terminal randomly, uniformly, independently 
demands a source out of the two sources in S = {8,11}. In 
addition, each selected terminal randomly chooses to demand 
the other source or not according to a Bernoulli distribution 
with probability g — 1 of selecting a second source, where 
q G [1,2]. Thus, q represents the expected number of sources 
selected by each terminal (i.e., Pt). Note that q = 2 indicates 
multicast, and q = 1 results in unicast connections. In this way, 
general connections are randomly generated with q controlling 
the average size of the intersections of the demand sets by 
different terminals. 
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Fig. 8: Sprint backbone network topology [35]. S = {8,11} and 
T C {2,3,4,6,9}. The edge costs are: 20 for edges (10,5) and 
(10,6), 10 for edge (9,4), and 1 for all the other edges. 



T=2 

T=3 


q=1.2 

q=1.8 

q=1.2 

q=1.8 

Problem 2 

7.49 

12.70 

12.82 

20.79 

Problem 1 

8.99 

13.80 

16.96 

23.20 

Two-step Mixing [27] 

8.99 

13.80 

17.24 

24.94 

Routing 

9.36 

18.68 

17.25 

32.34 


TABLE I: Average optimal network cost of the network in Fig. 8. 

A. Network Cost 

Table. I illustrates the average optimal network cost (aver¬ 
aged over 1000 random realizations) for different q and T. Note 
that the optimal network costs of Problems 1 and 2 are obtained 
by the centralized algorithm, i.e.. Algorithm 1. We can observe 
that the average optimal network costs of all the schemes 
increase with increases of q or T, i.e., the increase of network 
load. The average network costs of the optimal solutions to 
Problems 1 and 2 are lower than the optimal routing, with 
average cost reductions up to 28% and 36%, respectively. The 
average cost reductions are due to the network coding gain 
exploited by Problems 1 and 2. Specifically, edge (10, 7) can 
serve as the coding edge for the butterfly subnetwork consisting 
of nodes 6, 7, 8, 9, 10 and 11, and edges (7,4) and (4,1) can 
serve as the coding edge for the butterfly subnetwork consisting 
of nodes 1, 2, 3, 4, 6, 7 and 9, in the Sprint backbone network in 
Fig. 8. The network coding gain increases as g or T increases. 
This is because, using network coding, edges can be used more 
efficiently in the case of high network load. 

In addition, the average network costs of the optimal solu¬ 
tions to Problems 1 and 2 are lower than the two-step mixing 
approach, with average cost reductions up to 7% and 26%, 
respectively. The average cost reductions are due to the extra 
network coding gain (achieved through mixing) exploited by 
Problems 1 and 2. Specifically, given the demand sets of all 
the terminals, mixing or not in the two-step mixing approach 
(determined in the first step, separately from the second flow 
rate control step) is restricted by all the physical paths, while 
mixing or not in Problems 1 and 2 (determined jointly with 
flow rate control) is only restricted by the actual paths that 
each flow will take, which is also illustrated in the example 
in Fig. 2. Note that the average network cost of Problem 1 
is lower than the two-step mixing when T = 3. The average 
cost reductions of the optimal solutions to Problems 1 and 2 
increase as T increases, as there are more physical paths to 


terminals restricting network coding (mixing) in the two-step 
mixing approach. 

On the other hand, the average network cost of the optimal 
solution to Problem 2 is lower than that of the optimal so¬ 
lution to Problem 1, with average cost reduction up to 24%, 
illustrating the consequence of Lemma 1. For a given T, the 
performance gain of Problem 2 over Problem 1 decreases as q 
increases, since the difference between the feasibility regions of 
the two problems reduces with the increase of q. Note that when 
q = 2 (i.e., multicast), the two problems (feasibility regions) 
are the same. However, for a given q, the performance gain of 
Problem 2 over Problem 1 increases as T increases, since the 
difference between the feasibility regions of the two problems 
increases with the increase of T. 

B. Convergence 

We illustrate the convergence performance of own distributed 
Algorithm 4. Consider si = 8, S 2 = 11, = 2, t 2 = 6, 

Vi = {1,2}, V 2 = {2} and V = {1,2}. In this case, the 
optimal network costs of Problem 2, Problem 1, the two-step 
mixing approach and routing are 10, 28, 28, 28, respectively. 
The optimal network mixing solution to Problem 2 is achieved 
through the demand set expansion, i.e., 'Pi = 7^2 = T’ = {1, 2}. 
The expanded demand set corresponds to multicast, where the 
network coding gain is achieved. The optimal mixing (coding) 
solutions with cost 10 corresponds to flow paths 8 — 10 — 7 — 
4 - 1 - 2, 11 - 10 - 7 - 9 - 2 (11 - 9 - 2), 8 - 6 and 
11 — 10 — 7—6. There is no network mixing (coding) solution to 
Problem 1 and the two-step mixing approach. There are three 
optimal routing solutions of cost 28, which are also optimal 
(feasible but non-coding) solutions for Problem 1 and the two- 
step mixing approach. The first one corresponds to flow paths 
8-10-5-1-2,11-10-7-9-2 and 11-10-7-6. The 
second one corresponds to flow paths 8 — 10 — 7 — 4—1 — 2, 
11 — 9 — 2 and 11 — 10 — 6. The third one corresponds to flow 
paths 8 - 10 - 5 - 1 - 2, 11 - 9 - 2 and 11 - 10 - 7 - 6. 
In the following, we illustrate the convergence for the path- 
based and edge-based distributed algorithms for Problem 2 (at 
■Pi = 'P 2 = P = {1, 2}), respectively. 

1) Path-based Probabilistic Distributed Algorithm: Fig. 9 
illustrates the convergence of Algorithm 3 (i.e.. Step 3 in 
Algorithm 4). From Fig. 9, we can see that Algorithm 3 
converges to a feasible solution to Problem 2 quite quickly 
(within 25 iterations). This feasible solution corresponds to flow 
paths 8-10-7-4-1-2, 11-10-7-9-2, 8-10-7-6 
and 11 — 10 — 6 . The network cost of this feasible solution 
is 10, i.e., the optimal network cost to Problem 2. Fig. 10 
illustrates the convergence of Algorithm 4 for one instance. 
We can see that there exist multiple feasible mixing solutions 
to Problem 2, which are of different network costs, and running 
Algorithm 3 for multiple times can result in different feasible 
solutions. Thus, the minimum network cost may decrease as 
the number of iterations increases. Algorithm 4 obtains the 
optimal network cost 10 to Problem 2 quite quickly (within 
100 iterations). Fig. 11 illustrates the average convergence 
of Algorithm 4 over 1000 instances. We can see that on 
average, within 100 iterations, the minimum network cost under 
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(b) Flow path from (source) node 11 to (terminal) node 2. 
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(d) Flow path from (source) node 11 to (terminal) node 6. 


Fig. 9: Convergence of the path-based CFL in Algorithm 3 for 
Problem 2 of the network in Fig. 8. a = 0.05 and b = 0.009. These 
convergence curves are for one realization of the random Algorithm 3. 
Note that all the flow paths are shown in the figure. 



Iterations 

Fig. 11: Average minimum network costs of the path-based CFLs 
in Algorithm 4 for Problem 2 of the network in Fig. 8 over 1000 
instances. The red curve here represents the average of the red curves 
in Fig. 10 over 1000 instances. 
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Fig. 10: Network costs of Algorithm 4 for Problem 2 of the network in 
Fig. 8. Each blue dot represents the network cost of a feasible solution 
obtained by the path-based CFL in each iteration of Algorithm 4. 
While the red curve represents the minimum network cost obtained 
by Algorithm 4 within a certain number of iterations. The blue dots 
and red curve are for one realization of the random Algorithm 4. 


Algorithm 4 converges to 10, which is the optimal network 
cost to Problem 2 obtained by the centralized algorithm in 
Algorithm 1. 

2) Edge-based Probabilistic Distributed Algorithm: Fig. 12 
illustrates the convergence of Algorithm 5 (i.e., Step 3 in 
Algorithm 6). From Fig. 12, we can see that Algorithm 5 
converges to a feasible solution to Problem 2 within 8000 


Fig. 12: Convergence of the edge-based CFL in Algorithm 5 for 
Problem 2 of the network in Fig. 8. a = 1 and b = 0.01. Note that 
for each edge, the “Correct Event” indicates the variable taking the 
value which corresponds to the feasible solution obtained by the edge- 
based CFL. These convergence curves are for one realization of the 
random Algorithm 5. 



Fig. 13: Network costs of Algorithm 6 for Problem 2 of the network in 
Fig. 8. Each blue dot represents the network cost of a feasible solution 
obtained by the edge-based CFL in each iteration of Algorithm 6. 
While the red curve represents the minimum network cost obtained 
by Algorithm 6 within a certain number of iterations. The blue dots 
and red curve are for one realization of the random Algorithm 6. 
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Fig. 14: Average minimum network costs of the edge-based CFLs 
in Algorithm 6 for Problem 2 of the network in Fig. 8 over 1000 
instances. The red curve here represents the average of the red curves 
in Fig. 13 over 1000 instances. 


iterations. This feasible solution is the same as the one shown in 
Fig. 9, with network cost 10. By comparing Fig. 12 with Fig. 9, 
we can see that Algorithm 5 converges much more slowly than 
Algorithm 3. Fig. 13 illustrates the convergence of Algorithm 6. 
We can see that there exist for Problem 2, multiple feasible mix¬ 
ing solutions which have different network costs, and running 
Algorithm 5 for multiple times can result in different feasible 
solutions. Thus, the minimum network cost may decrease as the 
number of iterations increases. Algorithm 6 obtains the optimal 
network cost 10 to Problem 2 within 100 iterations. Fig. 14 
illustrates the average convergence of Algorithm 6 over 1000 
instances. We can see that on average, within 300 iterations, 
the minimum network cost under Algorithm 6 converges to 10, 
which is the optimal network cost to Problem 2 obtained by 
the centralized algorithm in Algorithm 1. By comparing Fig. 14 
with Fig. 11, we can see that Algorithm 6 converges much more 
slowly than Algorithm 4. 

VII. Conclusion and Future Work 

In this paper, we introduce linear network mixing coefficients 
for code constructions of general integer connections. For such 
code constructions, we pose the problem of cost minimization 
for the subgraph involved in the coding solution, and relate 
this minimization to a path-based CSP and an edge-based CSP, 
respectively. We present a path-based probabilistic distributed 
algorithm and an edge-based probabilistic distributed algo¬ 
rithm with almost sure convergence in finite time by applying 
CFL. Our approach allows fairly general coding across flows, 
guarantees no greater cost than routing, and demonstrates a 
possible distributed implementation. Numerical results illustrate 
the performance improvement of our approach over existing 
methods. 

This paper opens up several directions for future research. 
For instance, the proposed optimization-based linear network 
code construction for general integer connections can be ex¬ 
tended to design route finding protocols of superior perfor¬ 
mance for general connections. In addition, a possible direction 
for future research is to design dynamic approaches not only to 
build new subgraphs, but also to update them as they evolve, 
so as to reflect changes in topologies for varying networks, as 
occur in such settings as peer-to-peer (P2P) networks. Another 
interesting extension of the proposed approach to content¬ 
centric cache-enabled networks is to incorporate cache place¬ 
ment (which creates new sources) into the cost minimization 


for the subgraph involved in the coding solution in this work. 
Finally, the proposed approach for wireline networks can also 
be generalized to wireless networks by considering hyper edges 
to model broadcast links. 
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Appendix A; Proof of Theorem 1 

Let z, X, (3 and f denote a feasible solution to Problem 1. 
Note that x is uniquely determined by f3 according to (10) and 
(11), which correspond to Conditions 1) and 2) in Definition 
2. In addition, by (12), which corresponds to Condition 3) in 
Definition 2, we know that x ensures that for each terminal, 
the extraneous flows are not mixed with the desired flows on 
the paths to the terminal. We shall show that, based on /3, we 
can And local coding coefficients a, which uniquely determine 
feasible global coding coefficients c according to 

Cspi = Gp, (Sp,j) €£, p&V (28) 

Cij — y ^ (lj) C z ^ S (29) 

fceii 

where correspond to Conditions 1) and 2) in Definition 1). 


First, we choose akij = 0 if Pkij = 0. Note that, as a feasible 
solution, X is uniquely determined by /3 according to (10) and 
(11). In addition, we choose c based on a according to (28) 
and (29). Thus, by (28), (29), (10) and (11), we can show that 
Cij^p = 0 if Xij^p = 0 by induction. Thus, by (12), we have 

Cit,p = 0, is It, P ^ r"*, t (z T. (30) 

In other words, each terminal t € T only needs to consider 
pgp^ for decoding. By (8), we can form a flow path 
from source Sp to terminal t, which consists of the edges in 
4 = {(*, j) e £ ■■ flj^p = 1}, where p S Vt- By (3) and (7), 
we know that for all t G T, there exists Pt edge-disjoint unit 
flow paths, each one from one source Sp to terminal t, where 
p € Vt- Note that x satisfies all the conditions in Definition 
2. Thus, by (9), we know that all the flow paths satisfy that 
for each terminal, the extraneous flows (information) are not 
mixed with the desired flows (information) on the flow paths 
to the terminal. Let At denote the Pt x Pt matrix, each row 
(out of Pt rows) of which consists of the Pt elements in 
{Cit,p)p£Vt (*if) 

Pt flow paths) to terminal t, where i G It- Note that At 
(in terms of {cit,p)i^it peVt paths) can also 

be expressed in terms of local coding coefficients ot by (28) 
and (29).'^ By (28) and (29), we know that 1) and 2) of 
Definition 1 are satisfied. Therefore, it remains to show that 3) 
of Definition 1 is satisfied. This can be achieved by choosing 
{akij ■ ihj) G £,l3kij 0} so that At for alH G T are 
full rank, i.e., 7^ 0 [36, Pagesl9-20]. 

Next, we show that if F > T, we can choose {akij ■ 
G £,f3k^j ^ 0} such that liter ^ 0. 

We first show that for all t G T, det(At) is not identically 
equal to zero. For all p G Vt and t G T, choose akij = 1 
for all edges G £ on the flow path from source 

Sp to terminal t, i.e., {k,i),{i,j) G Cp, and akij = 0 for all 
edges {k,i), {i, j) G £ not on the same flow path, i.e., {k,i) 
or ihj) ^ 4' local coding coefficient assignment makes 
At the Pt X Pt identity matrix. Thus, det(At) is not identically 
equal to zero [36, Page 20]. Then, we show that ]^jg.y-det(At) 
is not equal to zero, using the algebraic framework in [36, Pages 
31-32]. Similarly to the proof of Theorem 3.2 in [36, Pages 
31-32], we can show that ]^jg.y-det(At) is a polynomial in 
unknown variables {akij ■ {k,i), {i,j) G £,/3kij 7 ^ 0} and that 
the degree of each unknown variable is at most T. Therefore, 
by Lemma 2.3 [36, Page 21], we can show that, for F > T, 
there exists a choice of {akij ■ {k,i),{i,j) G £,f3kij 7 ^ 0} 
such that ritGTr[®t(^t) 7^ 0- Recall that akij = 0 if j3kij = 0. 
Therefore, based on /3, we can obtain a that leads to feasible 
c. 


'^Given all the local coding coefficients a, we can compute global coding 
coefficients c, and vice versa. 
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